Corpus: fra-pf_web_2016_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 95 96 96 96 97
1000 791 931 953 959 964
10000 5857 8339 9276 9521 9594
100000 28720 62509 84311 92674 95290
1000000 28721 62510 84312 92675 95291


Zipf's diagram for sentence endings


Gnuplot diagram

6608 msec needed at 2018-04-25 05:25